Loss Recovery at the ATM Layer for Latency-constrained Reliable Multicast
نویسندگان
چکیده
Many multicast applications have stringent time constraints, and require a data delivery guarantee. Currently, all existing reliable multicast protocols are at the transport layer. A cell discarded by a switch causes the loss of the entire packet, and eventually requires the whole packet to be retransmitted. Therefore, a small congestion problem could potentially escalate to a more serious one, and this in turn prolongs data recovery. The problem can be worse in the case of multicast, since many copies of the same packet are often retransmitted unnecessarily, reaching receivers which did not request the packet. This problem is called the exposure problem. Recovery latency would be lower if only the actual lost data were retransmitted and not the entire packet. To reduce unnecessary data retransmission and thus recovery latency of reliable multicast applications, we propose a reliable multicast protocol in which detection of data loss and retransmission are performed at the ATM layer, instead of at the transport layer. The unit of recovery is ATM cells, instead of transport-layer packets: only missing cells are retransmitted. The proposed protocol uses the ATM physical tree structure for local recovery at the ATM layer. It is thus scalable and offers low recovery latency. In addition, retransmission operates with very minimal exposure, at the cell level, that does not aggravate existing network conditions. The built-in message stability detection scheme can be used as a rate control mechanism to help alleviate congestion, and therefore reduce recovery latency. The protocol is particularly beneficial to large-scale reliable multicast applications with stringent latency requirements such as distribution of financial data, distance learning, and Internet conferencing. We have run experiments to compare the performance of ATM-layer recovery and transport-layer recovery. Simulation results show that, with ATM-layer recovery, the average retransmission delay is significantly lower, and the connection throughput is higher under congestion.
منابع مشابه
Cache Partitioning for Multiple Sessions in Local Loss Recovery of Reliable Multicast
Local loss recovery for reliable multicast can provide significant performance improvement in terms of loss recovery latency, bandwidth consumption and network throughput. Active Reliable Multicast (ARM) is a novel loss recovery scheme for largescale reliable multicast. In ARM, local loss recovery is realized by retrieving repairs at near active routers. In practice, active resources are limite...
متن کاملTowards Scaleable Error Control for Reliable Multipoint Services in ATM Networks
Advanced applications, such as distributed multimedia applications, require efficient communication subsystems providing a variety of services. Existing communication systems face increasing difficulties in fulfilling these requirements. In particular, the efficient provision of reliable group communication services in ATM-Networks remains a major unresolved issue. This paper presents a novel f...
متن کاملAdaptable Error Control for Efficient Provision of Re- liable Services in ATM Networks
Distributed computing, distributed multimedia applications as well as advanced applications for computersupported cooperative work depend on high performance networks providing point-to-point and point-tomultipoint communication services. ATM networks promise to provide adequate support for these applications. However, the traffic originating from these applications is highly bursty and unpredi...
متن کاملDiscus: Lateral Error Correction for Time-Critical Multicast
Discus is a low-latency reliable multicast protocol designed for time-critical clustered applications. It uses IP Multicast to transmit data and recovers from packet loss in end-hosts using Lateral Error Correction (LEC), a novel repair mechanism involving the exchange of XOR repairs between receivers and across groups. In datacenters and clusters, application needs frequently dictate large num...
متن کاملPerformance Analysis of Loss Recovery Latency in Reliable Multicast Protocols using Active Parity Encoded Services
Providing an efficient and reliable multicast for data dissemination applications on a large scale is a challenge, especially when the applications require a very short delivery delay and high throughput. The combination of a local recovery approach based on active services with those using FEC/ARQ gives rise to a new class of reliable multicast protocols called APES "Active Parity Encoding Ser...
متن کامل